Large-Scale Discrete Fourier Transform on TPUs
نویسندگان
چکیده
In this work, we present two parallel algorithms for the large-scale discrete Fourier transform (DFT) on Tensor Processing Unit (TPU) clusters. The are associated with DFT formulations: one formulation, denoted as KDFT, is based Kronecker product; other famous Cooley-Tukey algorithm and phase adjustment, FFT. Both KDFT FFT formulations take full advantage of TPU's strength in matrix multiplications. formulation allows direct use nonuniform inputs without additional step. algorithms, same strategy data decomposition applied to input data. Through decomposition, dense multiplications kept local within TPU cores, which can be performed completely parallel. communication among cores achieved through one-shuffle scheme both sending receiving takes place simultaneously between neighboring along direction interconnect network. designed topology clusters, minimizing time required by cores. implemented TensorFlow. three-dimensional complex an example dimension 8192 ×8192 a Pod: run 12.66 seconds that 8.3 seconds. Scaling analysis provided demonstrate high efficiency implementations TPUs.
منابع مشابه
Discrete Fourier Transform on Multicores
This paper gives an overview on the techniques needed to implement the discrete Fourier transform (DFT) efficiently on current multicore systems. The focus is on Intel compatible multicores but we also discuss the IBM Cell, and briefly, graphics processing units (GPUs). The performance optimization is broken down into three key challenges: parallelization, vectorization, and memory hierarchy op...
متن کاملDft : Discrete Fourier Transform
A. Table of contents by sections: 1. Abstract (you’re reading this now) 2. Summary of the DFT (How do I do the homework?) 3. Review of continuous-time Fourier series 4. Bandlimited signals and finite Fourier series 5. Sampling theorem for periodic signals 6. Review of quirks of discrete-time frequency 7. Orthogonality and its significance 8. Discrete Fourier Transform (DFT) 9. Use of DFT to com...
متن کاملThe Discrete Fourier Transform∗
1 Motivation We want to numerically approximate coefficients in a Fourier series. The first step is to see how the trapezoidal rule applies when numerically computing the integral (2π) −1 2π 0 F (t)dt, where F (t) is a continuous, 2π-periodic function. Applying the trapezoidal rule with the stepsize taken to be h = 2π/n for some integer n ≥ 1 results in (2π) −1 2π 0 F (t)dt ≈ 1 n n−1 j=0 Y j , ...
متن کاملThe Discrete Fourier Transform
Disclaimer: These notes are intended to be an accessible introduction to the subject, with no pretense at completeness. In general, you can find more thorough discussions in Oppenheim's book. Please let me know if you find any typos. In this lecture, we discuss the Discrete Fourier Transform (DFT), which is a fourier representation for finite length signals. The main practical importance of thi...
متن کاملDiscrete Fourier Transform
This note provides a brief review of the Fourier transform for the analysis of discretetime signals and systems and a description of practical assignments, which will be performed on a Texas Instrument DSP board. Some tasks are to be performed with the help of MATLAB. It is assumed that the student has enough theoretical background to perform the practical part of the work. The review of some a...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Access
سال: 2021
ISSN: ['2169-3536']
DOI: https://doi.org/10.1109/access.2021.3092312